Bidirectional Attention Flow for Machine Comprehension

نویسندگان

  • Min Joon Seo
  • Aniruddha Kembhavi
  • Ali Farhadi
  • Hannaneh Hajishirzi
چکیده

Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query. Recently, attention mechanisms have been successfully extended to MC. Typically these methods use attention to focus on a small portion of the context and summarize it with a fixed-size vector, couple attentions temporally, and/or often form a uni-directional attention. In this paper we introduce the Bi-Directional Attention Flow (BIDAF) network, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bidirectional attention flow mechanism to obtain a query-aware context representation without early summarization. Our experimental evaluations show that our model achieves the state-of-the-art results in Stanford Question Answering Dataset (SQuAD) and CNN/DailyMail cloze test.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Start and End Interactions in Bidirectional Attention Flow for Reading Comprehension

The reading comprehension machine learning task involves reading in a question and returning an answer from an associated context paragraph. This task has proven to be difficult, as the performance of state-of-the-art models still do not compare with human performance. The difficulty of the tasks comes from understanding two separate pieces of information as well as the relationship between the...

متن کامل

Ensemble Learning For Machine Comprehension: Bidirectional Attention Flow Models

In this paper, we will explore machine comprehension in Stanford Question and Answering Dataset using ensembled deep recurrent neural networks with bi-directional attention flow. Given a context paragraph, we attempt to answer a query related to the context paragraph. This requires use to not only generate knowledge representation for each question and paragraph, but also create mechanisms that...

متن کامل

CS224n Assignment 4: Machine Comprehension with Exploration on Attention Mechanism

This goal of this paper is to perform the prediction task on SQuAD dataset about reading comprehension. Given a pair of context paragraph and a question, we’ll output an answer. To do this, a model is built combining the idea of Bidirectional LSTM and attention flow mechanism. The basic architecture and setup details of the model are introduced, so do the summary of performance and error analys...

متن کامل

Attention-based Recurrent Neural Networks for Question Answering

Machine Comprehension (MC) of text is an important problem in Natural Language Processing (NLP) research, and the task of Question Answering (QA) is a major way of assessing MC outcomes. One QA dataset that has gained immense popularity recently is the Stanford Question Answering Dataset (SQuAD). Successful models for SQuAD have all involved the use of Recurrent Neural Network (RNN), and most o...

متن کامل

A Convolutional Network Approach to Machine Comprehension

Machine Comprehension is a daunting task, since it requires cross-encoding and exchanging information between a context paragraph and a given query in order to produce an answer span. In designing baselines for a machine comprehension model, each model training has a long turnover, which does not bode well when there is limited time to train. Long runtimes are often from implementing recurrent ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1611.01603  شماره 

صفحات  -

تاریخ انتشار 2016